Introduction

Using our originally-pulled corpus of around 7,000 cases, I used Jimmy’s dictionary of terms to analyze it both as a complete set and on an individual-term basis.

Here is the raw dictionary of terms we looked for; for reference, terms were stemmed and lemmatized so that variants of the same term or phrase could be considered as one. Additionally, I removed stop words from all terms (common, non-substantive words) for ease of analysis with the full corpus.

Original versus Stemmed Dictionary Terms
Original Term Stemmed Term
educational opportunity educ opportun
equal opportunity equal opportun
objective object
intangible intang
tangible tangibl
engage with engag
exchange views exchang view
learn his profession learn profess
feeling of inferiority  feel inferior
psychological psycholog
substantial equality substanti equal
prestige prestig
alumni alumni
substantial equality substanti equal
reputation reput
standing in the community stand commun
traditions tradit
intellectual commingling intellectu commingl
aspirations aspir
control their own destinies control destini
self image self imag
lower expectations lower expect
stigma stigma
sociological sociolog
less measurable less measur
oppportunity to compete oppportun compet
antisocial attitudes and behavior antisoci attitud behavior
made equal made equal
melting pot melt pot
diversity divers
multiracial society multiraci societi
social skills social skill
non quantitative factor non quantit factor
integrated educational experience integr educ experi
participate fully particip fulli
bi racial community bi racial commun
meaningful integration meaning integr
artificial advantage artifici advantag
self perception self percept
attitudinal effects attitudin effect
grapple grappl
realistic attitudes realist attitud
negativism negativ
play together and interact plai togeth interact
get to know one another get know on anoth
respect the other s differences respect other' differ
tolerate each other toler
more complex complex
access access
enclave enclav
larger society larger societi
determine success determin success
majority culture major cultur
two societies two societi
mainstream of our society mainstream societi
different world differ world
self image self imag
greater contacts greater contact
greater understanding greater understand
notwithstanding such equality notwithstand equal
associate with children associ children
co mingle co mingl
extent of desegregation extent desegreg
social equality social equal
social matters social matter
conduit conduit
encounter of students encount student
wide exposure wide exposur
reputation of teachers reput teacher
apartheid apartheid
ability to learn abil learn
precise measurement precis measur
intangible vestiges intang vestig
socioeconomic class consciousness socioeconom class conscious
social scientific factors social scientif factor
characteristics for future success characterist futur success
preparing for meeting life prepar meet life
no known yardstick known yardstick
difficult to measure difficult measur
beauty of the campus beauti campu
develop relationships develop relationship
dominant class domin class
opportunity networks opportun network
white social networks white social network
old boy networks old boi network
higher standing higher stand
folkways folkwai
perceptions of the white majority percept white major
accredidation accredid
alumni alumni
standing of the institution stand institut
status statu
determine success determin success
outside the classroom outsid classroom
relations of men to one another relat men on anoth
positions of influence and power posit influenc power
form acquaintance form acquaint
dominant figures domin figur
to know and to be known know known
confine his association confin associ
being equalized equal
student interaction student interact
environment of a multi racial community environ multi racial commun
cultural value cultur valu
assimilation assimil
influence influenc
affluence affluenc
jealously guard jealous guard
traditionally closed tradition close
alumni certificate alumni certif
traditional society barriers tradit societi barrier
social pattern social pattern
personally acquainted person acquaint
pale shadow pale shadow
stature statur

Full Dictionary over Time

Next, I wrote a customized script to track the occurrences of variable-length phrases in a corpus (amazingly, this did not appear to exist in any package or Stack Overflow post I could find). I ran it with our dictionary and the full relevant corpus (specified as 1950-1974 per Jimmy’s specifications).

First, I did just a raw count of the dictionary occurring as a set over time:

## Warning: Removed 1 row(s) containing missing values (geom_path).

However, this raw count doesn’t paint the full picture, so I also did a relative count of terms. This is simply the percentage of terms in a given year that were terms from our dictionary.

Overall Rankings

Next, I ran some quick rankings. First up is a ranking of terms by overall counts:

Top Terms
Term Total Uses of Dictionary Terms
equal 5003
substanti equal 2360
object 2171
made equal 1662
equal opportun 1466
educ opportun 1424
engag 1090
determin success 982
statu 932
access 637
less measur 575
tradit 559
stand commun 518
stand institut 493
complex 489
social equal 485
social matter 453
associ children 447
extent desegreg 435
influenc 426
two societi 411
develop relationship 356
difficult measur 355
particip fulli 345
psycholog 314
know known 308
higher stand 291
greater understand 290
confin associ 287
precis measur 287
lower expect 272
social pattern 244
abil learn 236
divers 227
notwithstand equal 224
reput 216
toler 205
differ world 198
greater contact 170
feel inferior 158
major cultur 150
exchang view 149
meaning integr 144
cultur valu 129
outsid classroom 126
domin class 125
social skill 116
tradition close 113
larger societi 111
domin figur 103
learn profess 88
tangibl 85
alumni 84
realist attitud 80
encount student 75
self imag 72
person acquaint 62
form acquaint 60
reput teacher 60
integr educ experi 59
artifici advantag 55
posit influenc power 55
sociolog 55
prepar meet life 54
intang 47
prestig 45
self percept 43
aspir 42
wide exposur 42
student interact 41
stigma 39
characterist futur success 33
alumni certif 26
assimil 21
opportun network 20
conduit 19
tradit societi barrier 19
intellectu commingl 18
co mingl 17
intang vestig 17
bi racial commun 16
known yardstick 16
statur 16
enclav 15
mainstream societi 15
beauti campu 13
grappl 13
control destini 12
melt pot 9
affluenc 8
get know on anoth 8
pale shadow 8
social scientif factor 8
negativ 7
plai togeth interact 6
jealous guard 5
multiraci societi 5
percept white major 5
apartheid 4
non quantit factor 2
old boi network 2
relat men on anoth 2
socioeconom class conscious 2
white social network 2
accredid 1
antisoci attitud behavior 1
attitudin effect 1
folkwai 1
oppportun compet 1

Then, the top 25 cases with the highest dictionary term counts:

Top Cases - By Raw Counts
Case Total Uses in Corpus
hobson-v-hansen 905
united-states-v-morgan 575
san-antonio-independent-school-dist-v-rodriguez 574
oliver-v-kalamazoo-board-of-education 546
oregon-v-mitchell 384
hart-v-community-sch-bd-of-brooklyn-ny-sch-d-21 366
keyes-v-school-district-number-one-denver-colorado 329
baker-v-carr 315
swann-v-charlotte-mecklenburg-board-of-education 294
united-states-v-ei-du-pont-de-nemours-co 288
united-states-v-state-of-texas 275
morales-v-turman 256
telex-corp-v-international-business-machines-corp 242
united-states-v-hk-porter-company 223
abington-school-dist-v-schempp 221
briggs-v-elliott 220
gertz-v-robert-welch-inc 218
higgins-v-board-of-education-grand-rapids-mich 207
keyes-v-school-district-no-1-denver-colorado 195
stell-v-savannah-chatham-county-board-of-education 191
pennsylvania-assn-retd-child-v-commonwealth-of-pa 186
graves-v-barnes 181
stamps-v-detroit-edison-co 176
beer-v-united-states 175
bradley-v-milliken 175

Then, the top 25 most frequent cases in terms of dictionary richness (percentage of dictionary words in its overall word count):

Top Cases - By Dictionary Richness
Case Percent of Dictionary Terms in Case
hobson-v-hansen 19.925143
briggs-v-elliott 16.897081
briggs-v-elliott 14.369693
bush-v-orleans-parish-school-board 13.190731
bush-v-orleans-parish-school-board 12.542373
swann-v-charlotte-mecklenburg-board-of-education 10.414453
mcswain-v-county-board-of-education 9.921671
swann-v-charlotte-mecklenburg-board-of-education 9.767442
hart-v-community-sch-bd-of-brooklyn-ny-sch-d-21 9.334353
cisneros-v-corpus-christi-independent-school-dist 8.982512
lee-v-macon-county-board-of-education 7.285651
swann-v-charlotte-mecklenburg-board-of-education 7.158510
swann-v-charlotte-mecklenburg-board-of-education 7.123819
swann-v-charlotte-mecklenburg-board-of-education 6.571301
hobson-v-hansen 6.464747
keyes-v-school-district-number-one-denver-colorado 6.303890
keyes-v-school-district-number-one-denver-colorado 5.895001
norwalk-core-v-norwalk-board-of-education 5.812221
keyes-v-school-district-number-one-denver-colorado 5.746725
hobson-v-hansen 4.824351
oliver-v-kalamazoo-board-of-education 4.465892
moses-v-washington-parish-school-board 4.419036
evans-v-buchanan 3.761419
morales-v-turman 3.670251
united-states-v-state-of-texas 3.661784

Next, the top 25 courts with the highest dictionary term counts:

Top Courts Using Dictionary Terms - By Raw Counts
Court Total Uses of Dictionary Terms
scotus 4099
nysd 2085
dcd 1756
laed 1119
paed 1077
vaed 1068
ilnd 1011
miwd 969
txsd 799
almd 788
nyed 786
mied 722
mdd 665
txed 577
alnd 571
cod 565
ded 555
ncwd 497
ared 461
pawd 443
cand 441
txwd 430
mssd 394
gand 379
southcarolinaed 359

And the top 25 courts in terms of dictionary richness (percentage of dictionary words across all of a court’s text):

Top Courts - By Dictionary Richness
Court Percent of Dictionary Terms for all Cases
nmd 1.5140045
kyed 0.8928571
miwd 0.8294316
sdd 0.8205128
nyed 0.8038125
utd 0.8011653
nvd 0.7970849
caed 0.7892204
ctd 0.7256058
wied 0.7232585
southcarolinaed 0.7017750
gasd 0.6936031
nhd 0.6800151
ord 0.6653897
pamd 0.6603588
txed 0.6437002
ohnd 0.6401915
prd 0.6311993
cod 0.6298142
wiwd 0.6246042
tned 0.6173435
dcd 0.5977893
scotus 0.5949936
oked 0.5920079
cand 0.5735541

Individual Term Analyses

And finally, here are graphs for each individual word - both raw and relative counts.

## Warning: Removed 4 row(s) containing missing values (geom_path).

## Warning: Removed 1 row(s) containing missing values (geom_path).

## Warning: Removed 8 row(s) containing missing values (geom_path).

## Warning: Removed 10 row(s) containing missing values (geom_path).

## Warning: Removed 1 row(s) containing missing values (geom_path).

## Warning: Removed 4 row(s) containing missing values (geom_path).

## Warning: Removed 24 row(s) containing missing values (geom_path).

## Warning: Removed 24 row(s) containing missing values (geom_path).

## Warning: Removed 3 row(s) containing missing values (geom_path).

## Warning: Removed 17 row(s) containing missing values (geom_path).

## Warning: Removed 3 row(s) containing missing values (geom_path).

## Warning: Removed 20 row(s) containing missing values (geom_path).

## Warning: Removed 8 row(s) containing missing values (geom_path).

## Warning: Removed 13 row(s) containing missing values (geom_path).

## Warning: Removed 8 row(s) containing missing values (geom_path).

## Warning: Removed 2 row(s) containing missing values (geom_path).

## Warning: Removed 24 row(s) containing missing values (geom_path).

## Warning: Removed 17 row(s) containing missing values (geom_path).

## Warning: Removed 1 row(s) containing missing values (geom_path).

## Warning: Removed 15 row(s) containing missing values (geom_path).

## Warning: Removed 11 row(s) containing missing values (geom_path).

## Warning: Removed 2 row(s) containing missing values (geom_path).

## Warning: Removed 25 row(s) containing missing values (geom_path).

## Warning: Removed 15 row(s) containing missing values (geom_path).

## Warning: Removed 14 row(s) containing missing values (geom_path).

## Warning: Removed 3 row(s) containing missing values (geom_path).

## Warning: Removed 5 row(s) containing missing values (geom_path).

## Warning: Removed 18 row(s) containing missing values (geom_path).

## Warning: Removed 1 row(s) containing missing values (geom_path).

## Warning: Removed 1 row(s) containing missing values (geom_path).

## Warning: Removed 24 row(s) containing missing values (geom_path).

## Warning: Removed 7 row(s) containing missing values (geom_path).

## Warning: Removed 3 row(s) containing missing values (geom_path).

## Warning: Removed 1 row(s) containing missing values (geom_path).

## Warning: Removed 5 row(s) containing missing values (geom_path).

## Warning: Removed 1 row(s) containing missing values (geom_path).

## Warning: Removed 3 row(s) containing missing values (geom_path).

## Warning: Removed 23 row(s) containing missing values (geom_path).

## Warning: Removed 20 row(s) containing missing values (geom_path).

## Warning: Removed 24 row(s) containing missing values (geom_path).

## Warning: Removed 14 row(s) containing missing values (geom_path).

## Warning: Removed 25 row(s) containing missing values (geom_path).

## Warning: Removed 18 row(s) containing missing values (geom_path).

## Warning: Removed 3 row(s) containing missing values (geom_path).

## Warning: Removed 25 row(s) containing missing values (geom_path).

## Warning: Removed 2 row(s) containing missing values (geom_path).

## Warning: Removed 18 row(s) containing missing values (geom_path).

## Warning: Removed 16 row(s) containing missing values (geom_path).

## Warning: Removed 5 row(s) containing missing values (geom_path).

## Warning: Removed 5 row(s) containing missing values (geom_path).

## Warning: Removed 1 row(s) containing missing values (geom_path).

## Warning: Removed 13 row(s) containing missing values (geom_path).

## Warning: Removed 11 row(s) containing missing values (geom_path).